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Abstract 

Background: Error-prone PCR (epPCR) libraries are one of tine tools used in directed evolution. The Gateway®* 
technology allows constructing epPCR libraries virtually devoid of any background {i.e., of insert-free plasmid), but 
requires two steps: the BP and the LR reactions and the associated £ coli cell transformations and plasmid 
purifications. 

Results: We describe a method for making epPCR libraries in Gateway® plasmids using an LR reaction without 
intermediate BP reaction. We also describe a BP-free and LR-free sub-cloning method for in-frame transferring the 
coding sequence of selected clones from the plasmid used to screen the library to another one devoid of tag 
used for screening (such as the green fluorescent protein). We report preliminary results of a directed evolution 
program using this method. 

Conclusions: The one-step method enables producing epPCR libraries of as high complexity and quality as does 
the regular, two-step, protocol for half the amount of work. In addition, it contributes to preserve the original 
complexity of the epPCR product. 
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Background 

Gateway® is an appealing technology because its cloning 
efficiency is close to 100% [1]. This feature is particu- 
larly welcome when dealing with numerous target genes, 
for instance in Structural Genomics. Unfortunately, high 
throughput gene expression following gene cloning in 
Structural Genomics programs has also revealed that 
many recombinant proteins are insoluble in E. coli 
thereby precluding their crystallization and their study 
by X-ray crystallography. Among the different techni- 
ques used to overcome this insolubility problem, one is 
directed evolution. The use of directed evolution for 
improving recombinant protein solubility can be sum- 
marised as follows. A random library of mutants gener- 
ated by error-prone PCR (epPCR) and/or DNA shuffling 
[2] is screened for variant proteins more soluble than 
the wild-type (wt) protein. To that end, the mutated 
DNA sequences may be expressed as fusion proteins 
with a C-terminal "solubility reporter" such as the green 
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fluorescent protein (GFP) [3]. To assess the solubility 
gain provided by the mutations, the mutated coding 
sequences are then sub-cloned from the solubility repor- 
ter expression plasmid to a GFP-free expression plasmid 
and the solubility of the tag-free variant is compared to 
that of the tag-free wt protein expressed under the same 
conditions. 

Although the Gateway technology is less used in 
directed evolution than in Structural Genomics pro- 
grams, it has been nevertheless successfully applied in a 
directed evolution study that made use of both epPCR 
and DNA shuffling [4]. The evolved Tobacco Etch Virus 
(TEV) protease exhibited significantly higher solubility 
than the vwfTEV protease. Incidentally, this study also 
revealed a few weak points that seemed to be specifically 
associated with the use of the Gateway® technology 
rather than with the screening process or the protein to 
evolve. In particular, (i) the number of expression clones 
was found to be relatively small, as also reported in 
another study [5]; (ii) the generation of epPCR and 
DNA shuffling libraries was labor intensive because of 
the need for BP and LR reactions to be carried out, and 
of the corollary transformations and intermediate 
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plasmid medium preparations [6]; (Hi) the subcloning of 
the coding sequence of selected mutants from the repor- 
ter expression plasmid to a non-reporter expression 
plasmid was also time-consuming because of the same 
requirements. 

While the first of the drawbacks listed above can be 
easily addressed by transforming expression cells by 
electroporation, addressing the other two requires devis- 
ing a novel cloning and sub-cloning strategy. With the 
specific purpose of overcoming these limitations while 
maintaining the obvious advantages of the Gateway'* 
technology, we devised a method that allows eliminating 
the BP step from the generation of the library and both 
the BP and LR steps from the sub-cloning process. We 
applied this method to generate a diversity library of the 
intrinsically disordered C-terminal domain of the 
measles virus nucleoprotein (Ntail) [7,8] as a first step 
towards the dissection of the molecular mechanisms 
underlying its interaction with the C-terminal X domain 
(XD, aa 459-507) of the viral phosphoprotein [9-17]. A 
split-GFP reassembly assay [18-20] was used to screen 
the library and to identify clones with novel binding 
properties. 

Results 

1) Generation of an epPCR library 

The conventional procedure for generating epPCR 
libraries using the Gateway"' technology comprises two 
recombination reactions (BP and LR) [4]. We first 
addressed the question as to whether each recombina- 
tion reaction and associated E. coli cell transformation 
decreased the complexity of a given library. A typical 
Gateway® recombination reaction can be described as 
the transfer of an insert from a donor to a non-recom- 
bined acceptor to yield a recombined acceptor. There- 
fore, the library complexity loss can be evaluated by 
comparing the number of colonies provided by: (i) a 
theoretical experiment made of a 100% efficient LR 
reaction {i.e. a reaction where all the non-recombined 
acceptor {i.e., Gateway" plasmid before LR reaction) 
molecules are used to yield recombined acceptors {i.e., 
Gateway® plasmids after LR reaction)) followed by a 
100% efficient cell transformation {i.e. a transformation 
where all recombined acceptor molecules are uptaken 
by cells and where each cell uptakes one recombined 
acceptor molecule); (ii) an actual cell transformation by 
a recombined acceptor; (Hi) an actual cell transforma- 
tion by an actual LR reaction using the same donor con- 
struct {i.e., the other substrate of the LR reaction) and 
the same non-recombined acceptor as in the previous 
two instances. The results of this comparison are 
reported in Table 1. Since 25 fmoles of acceptor corre- 
spond to 1.55 X 10 molecules, if the LR reaction and 
cell transformation were each 100% efficient, then one 



Table 1 Assessment of the efficiency of E.coli 



transformation by different DNA species 



Experiment 


Transforming DNA 


Number of clones 


Theoretical 


LR reaction 


1.55 X 10^° 


Electroporation 


Recombined acceptor^ 


18.7 (± 7.2) X 10' 




LR reaction^ 


2.7 (± 1 .5) X 1 0' 


Heat shock 


Recombined acceptor^ 


10.8 (± 2.1) X 10^ 




LR reaction^ 


1 .8 (± 0.4) X 1 0^ 



The number of colonies obtained after £ coli transformation by a recombined 
acceptor\ or by an LR reaction^ is reported. T7pRos E. coli cells were either 
electroporated as described In "Methods", or transformed by heat-shock. 
Transformed cells were selected on ACplates. The results are the mean value 
and standard deviation of three independent transformations using the same 
LR reaction 

^25 fmoles of pNGG-NjAiL (Table 2). ^using 25 fmoles of non recombined 
acceptor plasmid (pNGG, Table 1) and 100 fmoles of linear Ntail coding 
sequence flanked by affL recombination sites (Figure 1, stage 1, right panel) 

should obtain 1.55 x 10^° colonies per 25 fmoles of 
input acceptor. However, transforming E. coli cells by 
electroporation with 25 fmoles of recombined acceptor 
provided a mean value of 18.7 x 10^ colonies. Assuming 
that each cell uptakes only one plasmid molecule, this 
means that E. coli cells electroporation was responsible 
for a ~80-fold drop of the theoretical (maximal) number 
of clones in this experiment. In addition, when E. coli 
cells were electroporated with an LR reaction mixture 
using the same amount of the same acceptor, the aver- 
age number of colonies was 2.7 x 10 (Table 1). This 
means that the recombination reaction was by itself 
responsible for an additional ~7 fold efficiency drop. 
Incidentally, this latter comparison provided a direct 
measure of the efficiency of the LR reaction, which 
could be determined to be approximately 14% under the 
conditions used in this experiment. In conclusion, each 
step made of a recombination reaction followed by 
transformation of E. coli by electroporation reduced the 
number of clones with respect to what could be 
expected from a 100% efficient LR reaction combined 
with a 100% efficient cell transformation. Thus, skipping 
one recombination reaction and the associated E. coli 
transformation in the Gateway® cloning process would 
be expected to better preserve the original library com- 
plexity than the classical two step approach (BP followed 
by LR), particularly when heat-shock is used instead of 
electroporation to transform E. coli cells (Table 1). Inci- 
dentally, this would also reduce the risk of biasing the 
pENTR library {i.e., the BP reaction product, see stage 1 
in the left panel of Figure 1) by unbalanced clone grow- 
ing during the culture in the presence of kanamycin. On 
the basis of this observation, we devised a new "single 
recombination" method for constructing epPCR 
libraries. 

The strategy we have devised is described in Figure 1 
(right flowchart) and compared to the regular strategy 
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BP 
reaction 



Cloning of wiid-type sequence to evoive 




Stage 2 




Sub-cloning mutant sequence from 
reporter to non-reporter expression plasmid 




- Comparison of wiid-type and mutated proteins - 



Standard strategy 



Strategy described in this study 



Figure 1 Overview of the method. The left flowchart is the 
standard strategy and the right flowchart is the strategy described 
in this study Brackets on the left indicate the two stages of the 
strategy: the epPCR library construction (stagel) and the sub- 
cloning of mutant inserts from reporter to non-reporter expression 
plasmid (stage 2). ER1 and ER2 denote restriction sites used to clone 
the sequence to evolve and create the PCR template of the 
standard strategy. Inner arrows with continuous lines are the core of 
the method. Outer arrows with dashed lines indicate the pathways 
used to transfer the wt sequence to the non-reporter expression 
plasmid, and (in the right flowchart only) to create the internally 
deleted wfNjAiL in pDEST170/l. IVlutated and wt sequences are 
represented by thick dark and light grey lines, respectively. The 
internally deleted wt sequence is denoted by an asterisk Antibiotic 
resistance markers are indicated: A, ampicillin resistance; K, 
kanamycin resistance. 



(left flowchart and reference [4]). It includes not only 
the generation of the library (Figure 1, stage 1), but also 
the sub-cloning of selected mutant sequences from the 
expression plasmid used for screening the random 
mutant library (referred to as "reporter expression plas- 
mid" throughout the text) to a different expression plas- 
mid (referred to as "non-reporter expression plasmid") 
(Figure 1, stage 2) allowing the variant proteins of inter- 
est to be expressed without the screening tag (such as 
GFP). In the regular strategy [4], generating and then 
cloning an epPCR library into a Gateway" reporter 
expression plasmid requires three steps (Figure 1, stage 
1, left flowchart). In the first step, the epPCR product 
(dark grey in Figure 1, stage 1) is generated using two 
primers hybridizing to the 5' and 3' ends of the coding 
sequence desired to be subjected to epPCR and flanked 
at their respective 5' end by Gateway" attBl and attB2 
recombination sites (25-mer), and a PCR template con- 
sisting of the wt target sequence (light-grey in Figure 1, 
stage 1). In the second step, the epPCR product is 
inserted in a shuttle plasmid (pDONR201) by mean of a 
BP reaction. After E. coli transformation, an aliquot is 
spread on kanamycin plates to assess the library com- 
plexity, and the remaining is grown in liquid culture 
(100 ml). A plasmid medium preparation is then per- 
formed to recover the cloned library (pENTR library) 
from the liquid culture. In the third step, an aliquot of 
the plasmid medium preparation is used in an LR reac- 
tion to transfer the inserts from the entry clones of the 
pENTR library to a reporter expression plasmid. The LR 
reaction is performed as described above for the BP 
reaction except that ampicillin is used instead of kana- 
mycin. In this procedure, obtaining a high-complexity 
library relies on the efficiency of two critical steps, 
namely the BP and LR reactions. In the strategy we 
developed, the epPCR library is generated using the 
atfLl (25-mer) and attL2 (24-mer) primers and the wt 
coding sequence already cloned in the pDONR201 plas- 
mid as PCR template. Primers attlA and attL2 respec- 
tively hybridize to the "Forward- and Reverse-priming 
sites" located upstream and downstream the attLl (100 
bp) and attL2 (100 bp) Gateway® recombination sites 
(Additional file 1: Figure SI). As a result, the epPCR 
product is flanked by full-length attLl and attL2 recom- 
bination sites and hence can be directly used in the LR 
reaction. This procedure enables to eliminate three steps 
of the regular approach: (i) the BP reaction that trans- 
fers the epPCR product to pDONR to create the 
pENTR library; (ii) the transformation of E. coli with the 
pENTR library; (Hi) the purification (plasmid medium 
preparation) of the latter. Beyond the advantage related 
to the reduction of the number of steps, this procedure 
also offers the advantage of preserving the library com- 
plexity as shown above. 
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Table 2 Plasmid constructs used in this study 



Name 


Insert 


Source 


Purpose 


pETlla-link- 
NGFP 


None 


[7] 


pNGG bacl<bone 


pNGG 


Gateway cassette 


This 
study 


Library screening 


pNGb-NjAiL 


M 

Ntail 


1 MIS 

study 


Positive control 


pNGb-btop-NjAiL 


StOp-NjAIL 


This 
study 


Negative control 


pUUINn-otOp- 

Ntail 


Stop- Ntail 


1 nis 
study 


LR substrate 


pULJINn-INjAIL 


I^TAIL 


Thic 

1 nis 

study 


epPCR substrate 


^ [ — ri 1 ^ V N 1^ rn 

pti 1 la-Z-NGhK 


Leucine zipper 


[7] 


Positive contro 


pMRBAD-Z-CGFP 


Leucine zipper 


[7] 


Positive control 


pMRBAD-linl<- 
CGFP 


None 


[7] 


CGFP cloning 
backbone 


pMRBAD-XD- 
CGFP 


XD 


This 
study 


Ntail interacting 
partner 


pDEST170/l- 

Ntail 


Ntail 


This 
study 


idNTAiL Pl^R template 


pDESTI 70/1- 

idNjAIL 


Internally deleted 
Ntail 


This 
study 


PCR2 template 



2) Sub-cloning of selected mutated sequences 

After mutants of interest have been selected, their cod- 
ing sequence must generally be sub-cloned from the 
reporter expression plasmid used to screen the epPCR 
library to another non-reporter expression plasmid (Fig- 
ure 1, stage 2). In the regular strategy [4], this procedure 
essentially follows the same process as that used to cre- 
ate the random library (compare stage 1 and stage 2 of 
Figure 1, left flowchart). In the first step, the mutated 
coding sequence is transferred from the reporter expres- 
sion plasmid to pDONR201 by mean of a BP reaction. 
E. coli cells are transformed with the BP reaction mix- 
ture, and transformed cells are selected on kanamycin 
plates. Given the efficiency of Gateway* transfer [1], few 
colonies need to be analyzed. The plasmid of each col- 
ony is purified by mini-preparation, and then checked 
for the presence of the proper insert generally by PGR 
using the attLl and attL2 primers. The plasmid of one 
positive pENTR clone is then used as substrate in an LR 
reaction to transfer the coding sequence from pDONR 
to the non-reporter expression plasmid. The regular 
strategy works well when few mutants have to be pro- 
cessed, but we realized that it was impractical in direc- 
ted evolution projects where many mutated coding 
sequences have to be sub-cloned in parallel from repor- 
ter to non-reporter expression plasmid(s). Therefore, we 
devised the alternative strategy depicted in the right 
flowchart of Figure 1, stage 2 and in more details in Fig- 
ure 2. This strategy could be described as a hybrid 
method between MEGAWHOP [21] and RF cloning 



[22]. MEGAWHOP was devised to clone a random 
library of linear DNA. In practice, a library of mutated 
linear DNA is used as a complementary pair of megapri- 
mers in a PGR experiment. The PGR template is the 
non mutated DNA sequence borne by the expression 
plasmid to be used for screening the library. Since the 
hybridization mismatch between the wt sequence borne 
by the expression plasmid and each mutated sequence 
to sub-clone (i.e., the mutation rate) is low compared to 
the homology, the megaprimers hybridize very efficiently 
to the wt sequence. During the PGR elongation steps, 
the whole plasmid is copied. After amplification, the 
PGR template is degraded by Dpnl treatment. The final 
product is the random library of DNA borne by the 
expression plasmid. RF cloning also aims at inserting a 
linear double strand DNA into a plasmid, but uses a dif- 
ferent approach. In contrast to MEGAWHOP, not the 
whole length of the linear DNA to clone is used for 
hybridization, but only 24 base pairs at both ends which 
are designed to be complementary to the cloning site of 
the plasmid. When this linear DNA is used in a PGR 
experiment with the cloning plasmid as template, each 
24 base flanking extension hybridizes to its complemen- 
tary sequence on the plasmid, resulting in a linear 
amplification of the plasmid during the PGR elongation 
step. After amplification the PGR template is degraded 
by Dpnl treatment. The final product is the linear DNA 
borne by the plasmid. Our sub-cloning technique relies 
on the use of megaprimers, as in MEGAWHOP, and on 
the annealing of only the ends of the megaprimers, as in 
RF cloning. The method consists of two steps. In the 
first step, the mutated coding sequence is PGR amplified 
using the attBl and attB2 primers and the reporter 
expression plasmid bearing the mutated coding 
sequence as template (PGRl, Figures 1 and 2). After 
Dpnl treatment to remove methylated {i.e. parental) 
DNA, the PGR product flanked by full-length attBl and 
attB2 Gateway® recombination sites is used as a pair of 
complementary megaprimers in a second PGR step that 
uses an internally deleted form of the wt coding 
sequence borne by the non-reporter expression plasmid 
as template (PGR2, Figures 1 and 2). The reason for 
using an internally deleted sequence as PGR2 template 
is explained in the last paragraph of the results section. 
The result of this second PGR is the full-length mutated 
coding sequence in the non-reporter expression plasmid. 

3) Application of the method to the Njail-XD 
interaction project 

We have applied the above-described strategy to few 
directed evolution projects (B. Goutard, F. Vincent, 
unpublished data), including the one reported herein. In 
view of gaining insights into the Ntail'XD interaction, 
we generated a mutant library of Ntail and used a split 
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fattB^ attB2^ 

Mutated 
sequence 
in reporter 
expression 
plasmid 
pNGG 



PCR1 using affB primers 
followed by Dpnl treatment 



affBI Mutated sequence attB2 



PCR2 using above PCR product as 
megaprimers, and internally deleted 
wild-type sequence in non-reporter 
expression plasmid as template... 

\/ 






Figure 2 Schematic of the sub-cloning of a mutated sequence from reporter to non-reporter expression plasmid. From top to bottom, 
the DNA fragments amplified by PCR between the T7prom and attB2 primers are 1,046 bp, 333 bp, and 560 bp in lengtli, respectively. On the 
right is illustrated how inverse PCR was used to internally delete 227 bp from the wtNyAiL sequence in pDEST170/l. Plasmids are not at scale. 
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GFP re-assembly assay to screen variants with altered 
interaction abilities [18-20]. In this method, the 
sequence coding for one of the two interacting partners 
under study is inserted in the prokaryotic expression 
vector pETlla-link-NGFP so as to lead to its expression 
as a fusion protein with the N-terminal half of GFP 
(NGFP). The coding sequence of the second interacting 
partner is inserted in another prokaryotic expression 
vector (pMRBAD-link-CGFP) thus leading to its expres- 
sion as a fusion protein with the C-terminal half of GFP 
(CGFP). Both fusion proteins are then co-expressed in 
E. coli. If the two proteins of interest interact with each 
other in the bacterium, their interaction allows the two 
GFP halves to re-associate and reconstitute the func- 
tional {i.e., fluorescent) GFP. Thus, fluorescent bacteria 
denote an interaction between the two partners, with 
the fluorescence intensity being proportional to the affi- 
nity of this interaction. In practice, a library of Ntail 
random mutants was inserted in a modified pETlla- 
link-NGFP vector (see below), and mutants were 
screened for altered interacting abilities with wfXD 
expressed from pMRBAD-XD-CGFP. 

Since neither pETlla-Unk-NGFP nor pMRBAD-link- 
CGFP are Gateway® plasmids, as a first step in this pro- 
ject we modified pETlla-link-NGFP to make it a Gate- 
way* reporter expression plasmid. This conversion is 
generally performed by inserting a blunt-end synthetic 
cassette that is flanked by attRl and attR2 recombina- 
tion sites, and bears two constitutive genes: a chloram- 
phenicol resistance (Cm"^) gene, and a counter-selectable 
(ccdB) gene. The cassette can be purchased from Invi- 
trogen in three reading frames, and is intended to be 
inserted in a blunt (or blunted) restriction site of the 
plasmid to modify. However, such a restriction site may 
not exist at the desired position, and blunt end cloning 
requires screening not only for cassette-containing plas- 
mids but also for the correct orientation of the cassette 
with regard to the promoter. Since pETlla-link-NGFP 
cloning site contains two restriction sites (Xhol and 
BamHI), an alternative to the classical approach was to 
PGR amplify the cassette using whatever Gateway*" des- 
tination plasmid as template and primers hybridizing to 
the 5' and 3' ends of the cassette and flanked by Xhol 
and BamHI restriction sites, respectively. Unfortunately, 
because of the high homology between the 5' and 3' 
ends of the Gateway® cassette (Figure 3A), each primer 
was found to be able to hybridize to both ends of the 
cassette (Figure 3B). To circumvent this problem, we 
devised the strategy depicted in Figure 3C. To prevent 
hybridization of the 5' primer with the 3'primer site and 
vice versa, the 5' and 3' halves of the cassette were PGR 
amplified separately (PGR tube 1 and PGR tube 2, Fig- 
ure 3G) using the Gateway" destination plasmid pTH31 
[4] as template. This template contained a single BamHI 



restriction site in the middle of the cassette which had 
to be mutated before cloning. We took advantage of its 
central location to develop a pair of internal overlapping 
primers (primers 2 and 3) encoding a mutated BamHI 
site to mutate the internal BamHI restriction site during 
the amplification of the two cassette halves. Internal 
PGR primers 2 and 3 were 100% complementary to 
each other, so that the PGR products from tube 1 and 
tube 2 overlapped. After Dpnl treatment to remove the 
template, and purification of the PGR products to 
remove PGR primers, a third PGR was run that only 
used an equimolar amount of the two PGR products, i. 
e., without adding external primers nor template (PGR 
tube 3, Figure 3C). This "elongation PGR" reconstituted 
the full-length cassette which was then ligated into 
pETlla-link-NGFP after BamHI and Xhol digestion. 
This Gateway® reporter expression plasmid was called 
pNGG (glasmid N-GFP Gateway ) and is available upon 
request. Incidentally, this "two halves" strategy is gener- 
ally applicable whenever direct directional cloning is 
hampered by a high homology of the two PGR primers. 

In a first step, the efficiency of LR reactions using dif- 
ferent donors and acceptors was evaluated under differ- 
ent experimental conditions. We first assessed the 
efficiency of an LR reaction that made use of a non- 
mutated linear molecular species bearing attL recombi- 
nation sites at the extremities (wiNxAiL'PCR, which 
could be described as a "ghost pENTR clone" since it is 
devoid of most of the pENTR backbone) and a circular 
Gateway® reporter expression plasmid (non-linearized 
pNGG), and compared it to that of LR reactions that 
were carried out with different combinations of linear 
and non-linear substrates. The results are reported in 
Table 4. The combination used in the strategy we 
devised, i.e. wiNxAiL-PCR and non-linearized pNGG 
(Table 4, line 3), provided at least as good results as the 
other combinations, in particular when compared to the 
reference wiNxAiL-pDONR/non-linearized pNGG com- 
bination (Table 4, line 1). Thus, a linear DNA flanked 
by full-length attL recombination sites proved to be an 
efficient substrate for the LR reaction. We also checked 
whether the use of non-equimolar ratios of either LR 
reaction substrates, as suggested by Invitrogen, was opti- 
mal. The results, reported in Table 3, indicated that 100 
fmoles of the donor construct and 25 fmoles of the 
acceptor construct provided at least as good results as 
an equimolar amount (100 fmoles of each), suggesting 
that 25 fmoles of non-linearized pNGG did not limit the 
LR reaction efficiency under these experimental condi- 
tions. The strategy we devised relies on the use of a pair 
of generic attL primers to generate an epPGR product 
flanked by the full-length attLl and attL2 recombina- 
tion sites (Stage 1 in the right flowchart of Figure 1). 
Since each attL recombination site is 100 bp long 
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\ 5 ' end ACAAGTTTGTACAAAAAAGCTGAA 

3 ' end CCACTTTGTACAAGAAAGCTGAA 

Xhol - at tRl GGGGCTCGAGTACAAGTTTGTACAAAAAAGCTGAA 

BamHI - a t tR2 - GGGGGGATCCACCACTTTGTACAAGAAAGCT 

Identity * * ********* ****** 



M 1 2 3 




PGR tube 1 



BamHI 



(1) 



/ \ 

#affR1 attR2m 
1 1 pTH31 1 



PCR1 followed by Dpnl digestion 



Xhol 



affR1 



Mutated 
BamHI 



BamHI 



PGR tube 2 




'affR1 affR2^ 

PTH31 11(4) 



PGR2 followed by Dpnl digestion 



Mutated 
BamHI 



attR2 



BamHI 



PGR tube 3 



PGR using only purified PGR1 and 



Xliol 



affR1 



Mutated 
BamHI 



affR2 



PGR2 products 



BamHI 




Ligation after 



Xliol and 
BamHI 
digestion 




T7P 



T7P 



Figure 3 Construction of pNGG. (A) Alignment of (from top to bottom): the 5' and 3' ends of pTH31 Gateway cassette; Xhol-affRl PGR primer; 
BamHI-atTR2 PGR primer. Sequence identity is denoted by asterisl<s below the alignment. (B) The Gateway® cassette was PGR amplified using 
pTHBl as template, and either primer Xhol-offRl alone (lane 1), primer BamHI-arTR2 alone (lane 2), or primers Xhol-affRl and BamHI-affR2 (lane 3). 
Markers size is indicated on the left in base pairs. (C) Two-halves" making of pNGG. The plasmids are not at scale. Light grey, Gateway cassette. 
Black, offR recombination sites. Primer (1), Xhol-attRl. Primer (2), BamHI-mut-R. Primer (3), BamHI-mut-F. Primer (4), BamHI-atTR2 (Table 3). 
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(Additional file 1: Figure SI), a risk potentially existed 
that mutations had been introduced by Mutazyme II in 
these sites during epPCR elongation steps. This would 
lead to an epPCR product partly unsuitable for the LR 
reaction, thereby ultimately resulting in a library com- 
plexity drop. As one can see in the first two lines of 
Table 5, 100 and even 10 fmoles of NxAiL'CpPCR pro- 
duct provided at least as many clones as did 100 fmoles 
of NxAiL-PCR product generated by a proof-reading Taq 
polymerase (Table 4 third line, and Table 3 first line). 
This suggests that if mutations occurred in the attL 
recombination sites during the epPCR, they did not 
interfere with the downstream LR reaction. This result 
is not completely surprising since it has been shown 
that deleting as much as 50% of attL recombinant site 
did not significantly reduce the percentage of recombi- 
nant clones [23]. Finally, we investigated the reproduci- 
bility of our approach. As shown in Table 5 (last three 
lines), the results obtained with Nxail/pNGG were not 
unique to this system, as comparable results were 
obtained when coding sequences with a composition 
and length (LadS, PA3059) different from that of Ntail 
were used in LR reactions with another reporter expres- 
sion plasmid (pTH31). This suggested that using a linear 
DNA generated by epPCR flanked by attL sites, and a 
non-linearized plasmid in the LR reaction was a gener- 
ally applicable method. 

In the next step, we evaluated the ability of our 
method for generating epPCR libraries (Stage 1 in the 
right flowchart of Figure 1) to provide mutants of inter- 
est. To that end, an epPCR library of Ntail in pNGG 
(Table 5, second line) was screened for clones displaying 
higher or lower fluorescence with respect to that of 
wiNxAiL when co-expressed with XD as described in 
Methods. More than 300 clones complying with these 
criteria were manually selected and characterized in 
terms of their fluorescence and of their sequence (Gruet 
et al., unpublished results). The latter revealed an aver- 
age mutation rate of 1% base pairs. Attempts to increase 
this rate by performing additional epPCRs failed for the 
reasons explained in Additional file 2: Figure S2. Figure 
4A shows the fluorescence of a representative sample of 
the selected clones. Antiparallel leucine zippers (Z) 



Table 3 Effect of the molar ratios of iv^Ntall-PCR (donor) 
and non-linearized pNGG (acceptor construct). 





Donor 


Acceptor construct 


Number of clones 


Electroporation 


100 fmoles 


25 fmoles 


0.95 X 1 




TOO fmoles 


100 fmoles 


0.4 X 10' 


Heat shock 


100 fmoles 


25 fmoles 


047 X 1 0^ 




100 fmoles 


100 fmoles 


0.16 X 10= 



The number of £ coli colonies provided by using two molar ratios of donor 
and acceptor construct in a 5 fjl LR reaction is reported. T7pRos f. coli cells 
were treated as described in Table 1 



Table 4 Comparison of expression library complexities 
provided by different combinations of linear and non- 
linear LR substrates using i^tN-pAiL sequence. 



Donor 


Acceptor construct 


Number of clones 


wfNTAiL-pDONR' 


Non-linearized pNGG 


0.3 X 10' 


wfNTAiL-pDONR^ 


Sall-linearized pNGG 


04 X 10' 


wrNjAiL-PCR^ 


Non-linearized pNGG 


1.6 X 10' 


WfNjAIL-PCR^ 


Sall-linearized pNGG 


0.25 X 10' 



The number of E. coli colonies provided by various combinations of 100 
fmoles of donor and of 25 fmoles of acceptor construct In a 5 pi LR reaction 
is reported. T7pRos E coli cells were electroporated and spread on ACplates 
VfNjAiL coding sequence borne by pDONR. ^tvfNjAiL coding sequence 
amplified using affLI and affL2 primers 



(pETlla-Z-NGFP and pMRBAD-Z-CGFP) were used as 
positive control of interaction [18] because of their high 
affinity, expression, and solubility. An Ntail coding 
sequence with an in-frame stop codon located just 
downstream attBl was used as negative control (Stop- 
NtaiL) S). This construct expresses only the NGFP moi- 
ety which is unable to interact with XD-CGFP. The 
reference fluorescence value was provided by iviNxAiL 
(N). A representative set of 4 mutants (1-4) with a fluor- 
escence similar to or lower than that of wfNxAiL is 
reported. The relationship between the mutations borne 
by these mutants (Additional file 3: Text SI) and their 
specific fluorescence is beyond the scope of this study 
and will be discussed elsewhere (Gruet et al., unpub- 
lished results). Notably, clone 4 featured a stop codon 
(R489 (CGA) > (TGA) Stop) that resulted in a truncated 
form of Ntail- Interestingly, this deletion perfectly 
mimics an already published variant (Ntaila2,3) that had 
been shown to display a considerably lower (two orders 
of magnitude) affinity towards XD [10]. Since we could 
not rule out a priori that a decreased fluorescence could 
be due to decreased protein expression and not to 
decreased Ntail-NGFP/XD-CGFP interaction, we 

Table 5 Comparison of expression library complexities 
provided by different combinations of epPCR products 
and of non-linearized acceptor constructs 



epPCR product Non-linearized acceptor Number of 

construct clones 



NjAiL^ 10 fmoles 


pNGG^ 


2.8 X 10' 


Ntail' 100 fmoles 


pNGG^ 


3.25 X 10' 


LadS^ 10 fmoles 


pTH31 = 


6.65 X 10' 


LadS^ 100 fmoles 


pTH31 = 


3.85 X 10' 


PA3059^ 42 


pTH31 = 


1x10' 


fmoles 







The number of f. coli colonies provided by various combinations of epPCR 
product and of 25 fmoles of acceptor constructs in 5 \i\ of LR reaction is 
reported. T7pRos f. coii cells were electroporated and spread on ACplates 
^Ntail PCR product is 661 bp. ^LadS PCR product is 696 bp. ^PA3059 PCR 
product is 1,780 bp. These three PCR product lengths include the affLI and 
affL2 recombination sites. '^pNGG is 8,012 bp before LR recombination. 
^pTH31 is 8,306 bp before LR recombination. 
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analyzed the total and soluble fractions of the different 
Ntail-NGFP clones (S, N, 1-4). Taking advantage of the 
presence of a 6His tag appended at the N-terminus of 
NGFP [18], the His-tagged proteins expressed by clones 
Z, S, N, and 1-4 were purified by immobilized metal 
(Ni^"^) affinity chromatography (IMAC) from the total E 
coli lysate under denaturing conditions, or from the 
soluble fraction of this latter under non denaturing con- 
ditions. The eluants from IMAC were analyzed by SDS- 
PAGE (Figure 4B). Notably, Stop-NxAiL. i^^Ntail. and 
all four mutants exhibited comparable levels of total 
expression of the NGFP-Ntail fusion protein (arrows 1 
and 2 in Figure 4B) suggesting that fluorescence differ- 
ences were due to different interaction abilities and not 
to different protein expression levels. Moreover, the 
expression level of the NGFP-Nxail protein by the non- 
fluorescent mutant 4 was even slightly higher than that 
of the other mutants (arrow 2 in Figure 4B). Under non 
denaturing conditions, the interacting partner fused to 
CGFP and devoid of His-tag (arrows 5 and 6 in Figure 
4B) was co-purified with the His-tagged protein. As 
expected, its amount paralleled the fluorescence intensi- 
ties reported in Figure 4A. By contrast, when proteins 
were purified by IMAC under denaturing conditions, 
only the His-tagged moiety was recovered from total E 
coli lysate. Altogether, these results indicate that differ- 
ent NxAiL'XD interaction capabilities due to specific 
mutations in the Nxail sequence and not different 
Ntail expression levels accounted for the different 
fluorescence intensities reported in Figure 4A. Thus, our 
one step method for generating epPCR libraries (Stage 1 
in the right flowchart of Figure 1) proves to be an effec- 
tive mean to yield mutants of interest. 
In view of investigating interaction capabilities in the 
absence of the GFP moiety (to be published elsewhere), 
the mutated N-jail coding sequences were sub-cloned 
from pNGG (reporter expression plasmid) to 
pDEST170/I (non-reporter expression plasmid), a 
frame-compatible plasmid allowing proteins to be 
expressed with a simple N-terminal His-tag [24]. Sub- 
cloning was performed as described above (Figure 1, 
stage 2, right panel and Figure 2). The efficiency of this 
approach is highly dependent on the efficiency of Dpnl 
treatment performed after each PCR. An inefficient 
Dpnl treatment after PCRl would generate a back- 
ground made of the donor construct (mutant sequence 
in pNGG). An inefficient Dpnl treatment after PCR2 
would generate a background made of the acceptor con- 
struct (wt sequence in pDEST170/I). Unfortunately, this 
signal-to-noise ratio could be appraised only by sequen- 
cing plasmids borne by several randomly chosen clones. 
To avoid sequencing PCRl or PCR2 background, we 
introduced a screening step so as to be able to distin- 
guish background plasmids from the desired construct. 



The easiest way to do that was to use a pair of screening 
primers that amplify fragments of different lengths from 
the three possible constructs (i.e., the correct construct, 
and PCRl and PCR2 backgrounds). Although a primer 
pair made of T7prom and attB2 primers could distin- 
guish pNGG from pDEST170/I, it failed to differentiate 
a mutated N-xail from the wiNxAiL sequence in 
pDEST170/I, as both have the same length. A solution 
to this issue was to use as template in PCR2 an intern- 
ally deleted wtNxAiL sequence (Figure 2) instead of the 
full-length sequence as envisaged by MEGA WHOP [21]. 
The internally deleted (227 bp deletion) Nxail borne by 
pDEST170/I (pDEST170/I-idNxAiL, Table 2) was con- 
structed by inverse PCR using the pDEST170/I-NxAiL 
plasmid (Table 2) as template and primers pl70/ISallF 
and pl70/ISallR (Table 6). Following Dpnl digestion, 
the PCR product was purified and then digested with 
Sail and self-ligated (Figure 2). Under these conditions, 
PCR screening E. coli colonies after transformation with 
DpnI-treated PCR2 product using T7prom and attB2 
primers was expected to lead to amplification of either a 
1,046 bp band (PCRl background), or a 560 bp band 
(correct construct), or a 333 bp band (PCR2 back- 
ground). Typical PCR screening results are reported in 
Figure 4C. First experiments (experiment I in Figure 4C) 
revealed a low signal (i.e., correct construct) to back- 
ground ratio. In gel I, clone 1 was PCR2 background, 
while clones 2, 3, 6 and 7 were PCRl background and 
only clones 4 and 5 contained the correct construct. 
The high background (5 clones out of 7) denoted ineffi- 
cient Dpnl treatments, particularly after PCRl (4 clones 
out of 7). This issue was addressed in two different 
ways. Firstly, we reduced the background by improving 
Dpnl treatment and by reducing the amount of the tem- 
plate used in PCRl as the PCR template is responsible 
for the background. Secondly, as the signal is propor- 
tional to (and limited by) the amount of megaprimers 
used in PCR2 we increased the signal by increasing the 
amount of megaprimers produced by PCRl. In practice, 
to both increase the signal and decrease the background 
we combined several modifications of the reaction: more 
Dpnl was used for longer incubation times, more of 
each primer, more PCR cycles, larger PCR volumes and 
less PCR template were used in experiments II and III 
(Additional file 4: Table SI) to generate the megapri- 
mers (PCRl). In experiment II, the template used for 
PCR2 was full-length Nxail in pDEST170/I. The new 
experimental conditions (Additional file 4: Table SI) 
completely abolished PCRl background (no band 
detectable at 1,046 bp in Figure 4C, experiment II). 
However, the use of full-length Nxail in pDEST170/I 
as template in PCR2 did not allow detecting PCR2 back- 
ground. Therefore, an experiment was performed as in 
experiment II except that internally deleted Nxail in 
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Figure 4 Representative results of library screening and sub-cloning experiments. (A) The fluorescence to ODeoo ratio (mean value and 
standard deviation of a triplicate experiment) of the clones indicated on the x axis were determined as described in Methods. Z, Leucine 
zippers; S, Stop-NjAiL; N, w/fNTAib 1-3, full length mutated Njaiu 4, truncated Njail mutant. (B) His-tagged proteins expressed by clones Z, S, N, 1-4 
(Figure 4A) were purified by affinity chromatography on IMAC as described in Methods, and were analyzed by SDS-PAGE using 15% 
polyacrylamide gels and Coomassie blue staining. Soluble, His-tagged proteins were purified under non denaturing conditions from the soluble 
fraction of the E. coli lysate. Total, His-tagged proteins were purified under denaturing conditions from total £ coli lysate. Soluble and total 
fractions were obtained from a duplicate culture. M, molecular size markers (from top to bottom: 170, 130, 100, 70, 55, 40, 35, 25, 15, 10 kDa). 
Arrows indicate the different purified proteins: 1, NGFP- wtNjAiL and NGFP-fuli-length Njail variants (34 kDa); 2, NGFP-truncated Njail variant 4 
(294 kDa); 3, NGP-Z (22.8 kDa); 4, NGFP [i.e., Stop-NrAii) (204 kDa); 5, XD-CGFP (15.5 kDa); 6, Z-CGFP (13.3 kDa). (C) PGR screening of mutated 
Njail sub-cloning experiment from pNGG to pDEST170/l. PGR control a, 1,046 bp; b, 550 bp; c, 333 bp. PGR screening II and III were run in the 
same gel along with controls a, b, and c. M, molecular size markers (from top to bottom: 2000, 1500, 1000, 750, 500, 250 bp). 
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Table 6 Primers used in this study 







Pi irnnco 


Ncol-MeV. 

AU 

MeV.XD-Aatll 


GGGGCCATGGGCGCATCACGCAGTGTAATCCGCTCC 


XD PGR amplification 


GGGGGACGTGGACTTGATTArrATClTCACCAGGAT 




NtailF 


GGGGAGAAGTTTGTAGAAAAAAGCAGGCTCTACTACTGAGGAGAAGATGAGTAGA 


NjAiL PGR amplification 


NtailR 


AGCTTTCTTGTACAAAGTGGTGGATCCCCCC 




StopNtail 


GGGGACAAGTTTGTACAAAAAAGCAGGCTCTTAATAAACTACTGAGGAGAAGATCAGTAGA 


StopNtail PGR amplification 


Xhol-otfRI 

(primer 1) 


GGGGCTGGAGTAGAAGTTTGTAGAAAAAAGCTGAA 


pNGG construction (Gateway® cassette 5' 
halve) 


BamHI-mut-R 
(primer 2) 


TCTGGGTTTTAGTAAGCCGGAAGCTCTAGATTACGGCGCGCGCrG 




BamHI-mut-F 
(primer 3) 


LAGGGLGGGGLG I AA 1 L 1 AGAGG 1 1 LLGGL 1 1 AG 1 /\AA/\ijLLA^f\ 


pNGG construction (Gateway'"" cassette 3' 
halve) 


BamHI-affR2 
(primer 4) 


GGGGGGATGCAGCACTTTGTAGAAGAAAGCT 




p170/ISanF 


GGGGGTCGACGCAGGAATCTCGGAAGAACAAGGC 


PGR construction of pDESTl 70/l-idNTAiL 


p170/ISallR 


GGGGGTCGACCGTGTAGAAATGATAGTTGGGC 




T7prom 


TAATACGACTCACTATAGG 


Sub-cloning screening 


attB2 


CCACTTTGTACAAGAAAGCJGGGT 




atfLl 


TGGCGTTAACGCTAGCATGGATCTC 


epPCR 


atrL2 


GTAACATCAGAGATTTrGAGACAC 





pDEST170/I was used as template in PCR2. The results 
(experiment III in Figure 4C) confirmed the absence of 
PCRl background and pointed out a low level (similar 
to that of experiment I although PCR2 template had 
been increased, see Additional file 4: Table SI) of PCR2 
background. PCR-positive clones 1, 2, 3 and 5 from 
experiment III were sequenced, and proved to bear the 
correct insert in the correct plasmid. In addition, 
sequencing ruled out the presence of any unwanted 
mutations that could have been introduced because of 
the increase (from 10 to 30) in the number of cycles in 
PCRl. 

Discussion 

In this study, we described a new method for creating 
high complexity epPCR libraries based on a modified 
use of the Gateway" recombination cloning technology. 
Compared to the conventional Gateway" protocol ([4] 
and stage 1 in the left flowchart of Figure 1), our 
method is faster as it skips some steps such as the BP 
reaction and the associated E. coli transformation and 
plasmid purification. Note that although we only used 
epPCR for generating random mutant libraries, we 
believe that our strategy could be easily applied to the 
generation of libraries based on DNA shuffling [2]. 
Since the conventional sub-cloning of selected mutated 
sequences requires the same steps as those typically 
used to create the library ([4] and stage 2 in the left 
flowchart of Figure 1), we also devised a sub-cloning 



strategy that allows several mutant sequences to be sub- 
cloned in parallel from the reporter expression plasmid 
used to screen the library to another expression plasmid 
devoid of the screening tag (GFP in this case). We have 
applied and validated the method in three directed evo- 
lution projects, and provided here the first results 
obtained in the case of the Ntail'XD system. In the 
other two projects, we constructed other epPCR libraries 
that made use of other target sequences to evolve, as 
well as of another reporter expression plasmid. In all 
cases, comparable library complexities were obtained 
(Table 5 and unpublished data). 

As shown in Table 1, skipping the BP reaction used in 
the standard protocol to clone the epPCR product in 
pDONR (stage 1 in Figure 1) appears to preserve the 
library complexity. However, one may argue that the 
price paid for this preservation is that the wt sequence 
must first be cloned into pDONR, and so the benefit of 
our strategy (i.e. the reduction in the number of steps) 
would be abrogated by this first mandatory "pre-clon- 
ing" BP reaction. We would like to point out that this is 
not the case. This starting construct is instrumental not 
only in the epPCR step but also, directly or indirectly, in 
three other steps of our procedure (see dashed line in 
the right flowchart of Figure 1). Firstly, this pDONR 
construct bearing the wt sequence is required to gener- 
ate by LR reaction the pDESTiyO/I-wtNxAiL construct 
(i.e. the non- reporter expression plasmid bearing the wt 
sequence) that will be used as a control in comparative 
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expression experiments with the selected mutated 
sequences (Figure 1). Secondly, pDEST170/I-H'fN-xAiL 
allows the construction of the internally deleted tem- 
plate used in the sub-cloning (Figure 2). Thirdly, 
pDEST170/I-VFiNxAiL also provides a positive control in 
PGR screenings aimed at avoiding the sequencing of 
background clones (Figure 4C). By contrast, since the 
standard strategy (stage 1 in the left flowchart of Figure 
1) used a non Gateway* plasmid as template for the 
epPCR, the same result will require much more work 
and steps (compare dashed lines in the left and right 
flowcharts of Figure 1). Thus, what could first appear as 
a constraint finally proves to be a saving of time when 
compared to the standard protocol. 

RF cloning and MEGA WHOP could not be used in 
our sub-cloning protocol exactly as they were published, 
and needed to be adapted for the following reasons. In 
the published RF cloning technique [22], PGRl was per- 
formed using genomic DNA as template, and so ineffi- 
cient Dpnl digestion would simply result in bacteria 
being transformed with genomic DNA. Since the latter 
did not provide any antibiotic resistance it could not 
generate any background, whereas the template was 
responsible for PGRl background in our case (Figure 
4G). In the case of MEGAWHOP [21], donor and 
acceptor plasmids were the same. Here again, inefficient 
Dpnl digestion of PCR2 product would only increase 
the amount of non mutated sequences in the final 
epPGR library, a situation frequently encountered even 
at high mutagenesis rates (A. Gruet, unpublished obser- 
vations), whereas it was responsible for PGR2 back- 
ground in our case (Figure 4G). 

It is noteworthy that the sub-cloning method that we 
developed was made possible because of the following 
two features. Firstly, donor (pNGG) and acceptor 
(pDEST170/I) plasmids were both Gateway"" plasmids 
and hence shared common 5' and 3' PGR priming sites. 
Moreover, these PGR priming sites did not require spe- 
cific but generic attBl and attB2 primers. Secondly, 
thanks to the specific founding features of Gateway"', 
pNGG and pDEST170/I were 5' and 3' reading frame 
compatible, a feature that can be easily obtained for any 
expression plasmid suitably modified to bear the Gate- 
way* cassette. For example, we have taken advantage of 
this feature to sub-clone mutated coding sequences 
from pTH31 to pTH24 [4] in another directed evolution 
project (B. Goutard, unpublished results). 

Although it does simplify sub-cloning when more than 
five mutant coding sequences have to be processed in 
parallel, the sub-cloning strategy that we devised could 
be further optimized by using a PGR2 template that 
could be detected directly on plates. This would save 
the time devoted to pick up and grow randomly chosen 
clones and to purify and screen their plasmids so as to 



distinguish them from those from background colonies. 
Actually, we sought at using Rubredoxin [25] as such a 
possible phenotypic marker of PGR2 background. 
Unfortunately however, following transformation of E. 
coli with a pDEST170/I plasmid bearing the sequence 
of a synthetic Thermotoga maritima Rubredoxin gene 
(NGBI reference sequence: NP_228468.1) that had been 
optimized for translation in E. coli, no red colonies were 
obtained (data not shown). Another even more stringent 
mean would be to use a ccdB constitutive gene in 
pDEST170/I instead of an internally deleted Ntail [26]. 
In that case, there would be no PGR2 background as 
clones bearing the ccdB gene are not viable. 

Before concluding, we would like to point out that the 
method described here is proposed as a toolbox made 
of three independent parts: (i) the production of epPGR 
libraries, (ii) the "two halves" construction of a Gate- 
way® plasmid, (Hi) a sub-cloning strategy. The Nxail" 
XD project provided an opportunity to describe a com- 
bined usage of the three parts, but these can be used 
separately (B. Goutard, unpublished results). 

The Gateway" cloning technology has been a tremen- 
dous breakthrough since it was implemented in the 
early 2000s. Perhaps not by chance, its commercial 
availability happened to coincide with the beginning of 
the Structural Genomics era. This coincidence undoubt- 
edly helped Structural Genomics programs to comply 
with their time schedule [1]. To our surprise, we have 
been unable to find numerous directed evolution pro- 
jects that made use of the Gateway® technology. We 
hope that the modified protocol described in this study 
will contribute to a broader usage of this powerful tech- 
nology in studies aimed at evolving proteins. 

Conclusions 

We have described a method to create high complexity 
epPRG libraries using only the LR reaction of the Gate- 
way" cloning technology. Directly cloning an epPGR 
product in the plasmid used for screening the library 
allows eliminating not only the BP reaction but also the 
associated E. coli cell transformation and plasmid purifi- 
cation required by the usual Gateway® procedure. Thus, 
the same result can be obtained with half the amount of 
work. 

Methods 

Materials 

Turbo broth (TB) was from AthenaES™. NucleoSpin® 
Extract 11 and NucleoSpin® Plasmid kit (Macherey- 
Nagel) were used for purifying PGR products and for 
plasmid mini-preparations, respectively. DNA ligations 
were performed using Quick Ligation Kit (Roche). Gen- 
eMorph® II Random Mutagenesis Kit (Stratagene) was 
used for epPGR experiments. Preparative and analytical 
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PCRs were carried out using Pfx and Taq polymerase 
(Invitrogen), respectively. All primers were purchased 
from Operon. DNA sequencing was performed by 
GATC Biotech. 

DNA constructs 

pMRBAD-link-CGFP [19], encoding the C-terminal half 
(aa 158-238) of eGFP under the control of the Pbad 
promoter (inducible by L-arabinose) and allowing 
upstream in frame cloning of DNA fragments, was a 
kind gift of Lynne Regan, as were pETlla-link-NGFP, 
pETlla-Z-NGFP and pMRBAD-Z-CGFP. These latter 
two encode fusion proteins between leucine zippers and 
the two halves of the eGFP, while pETlla-link-NGFP 
encodes the N-terminal half (aa 1-157) of eGFP under 
the control of the T7 promoter (inducible by IPTG) and 
allows downstream in-frame cloning of DNA fragments 
[19]. pMRBAD-XD-CGFP was constructed as follows. 
The coding sequence of XD was PGR amplified using 
primers Ncol-MeV.XD and MeV.XD-Aatll (Table 6) 
and a plasmid encoding the X domain (aa 459-507) of 
the measles virus phosphoprotein (strain Edmonston B) 
(pDEST14/XDhc) as template [9]. DpnI-treated purified 
PGR product and pMRBAD-link-CGFP were ligated 
after Ncol and Aatll digestion. pDONR-NxAiL was 
obtained by BP reaction with a PGR Nxail fragment 
amplified using primers NtailF and NtailR (Table 6), 
and a plasmid encoding the measles virus nucleoprotein 
(strain Edmonston B) (pET-21a/N) as template [27]. 
pDONR-Stop-NxAiL was obtained by BP reaction with a 
PGR Ntail fragment amplified using primers StopNtail 
and NtailR (Table 6) and pDONR-NxAiL as template. 
StopNtail has two Stop codons before the first Ntail 
codon. pNGG-NxAiL and pNGG-Stop-NxAiL were 
obtained by LR reaction using pNGG as destination vec- 
tor and pDONR-NxAiL or pDONR-Stop-NxAiL as donor 
constructs, respectively. pDEST170/I-NxAiL was 
obtained by LR reaction using pDONR-NxAiL and 
pDEST170/I [24]. The constructs were verified by 
sequencing and found to conform to expectations. The 
plasmids used in this study are summarized in Table 2. 

Strains and electroporation 

DB3.1 cells (Invitrogen) were used to propagate non 
recombined Gateway® plasmids, and TAMl cells (Active 
Motif) to propagate non-Gateway® and recombined 
Gateway® plasmids. T7 cells (New England Biolabs) 
bearing the pLysS plasmid from Rosetta(DE3)pLysS cell 
(Novagen) (referred to as T7pRos) were used for protein 
expression and for epPGR library screening. 

Electrocompetent T7pRos cells were prepared as fol- 
lows. Frozen cells that had been previously transformed 
with pMRBAD-XD-GGFP were used to seed 1 L of LB 
containing 50 |ig/ml kanamycin and 34 [ig/ml 



chloramphenicol, and allowed to grow until ODgoo = 
0.5. The cells were recovered by centrifugation for 10 
min at 5000 g, and resuspended in 400 ml of ice-cold 
water. Three additional washings were performed using 
300 and then 200 ml of ice-cold water. The cells were 
then washed with 80 ml of ice-cold 10% glycerol, and 
finally re-suspended in 4 ml of the same buffer. Aliquots 
of 1 ml were stored frozen at -80°C until use. For elec- 
troporation, 50 [il of cells were mixed with 1 \il of DNA 
and electroporated in 1 mm wide cuvettes (Eurogentec 
GE-0001-50) at 1660 V using an Eppendorf electropora- 
tor 2510. Under these conditions, an average time con- 
stant of -5.4 milliseconds was observed. 

Error-prone PCR and Gateway® cloning 

Error-prone PGR experiments were performed following 
the indications provided by the GeneMorph* II Random 
Mutagenesis Kit instruction manual (Stratagene) using 
10 ng of template in 50 |il of PGR mix. At the end of 
the PGR, 1 \il (20 U) of Dpnl (New-England Biolabs) 
was added to the PGR mix which was then incubated 
for 1 h at 37°G. After purification, the quality and quan- 
tity of the PGR product were estimated as described in 
the kit manual and by spectrophotometry. 

LR reactions were performed overnight at 26°G in a 
dry incubator in a final volume of 5 |il containing 1.5 ^1 
of each of the donor and acceptor construct, 1 |il of 5x 
LR buffer, and 1 |il of LR clonase enzyme mix (Invitro- 
gen). The next day, 1 [^1 of a 2 ^g/|il proteinase K solu- 
tion (Euromedex, EU0090) was added and the reaction 
mix was incubated for 15 min at 37°G. LR reactions 
were diluted to 50 \xl with water, and 1 \il of this dilu- 
tion was used to electroporate T7pRos cells as described 
above. Immediately after electroporation, the cells were 
resuspended in 15 ml of SOG medium, and then incu- 
bated for 1 hour at 37°G under 200 rpm shaking. Serial 
dilutions of an aliquot were plated on AKGplate to 
assess the complexity of the library. The remaining bac- 
terial suspension was supplemented with ampicillin, 
kanamycin and chloramphenicol at a final concentration 
of 100, 50 and 34 [ig/ml, respectively, and grown at 37° 
G under 200 rpm shaking. When the culture medium 
became turbid (~5 h), it was supplemented with glycerol 
at a final concentration of 20% (volume/volume), and 
300 [il aliquots were frozen at -80°G. 

Library screening 

An aliquot of the library was thawed and serial dilu- 
tions were spread on AKGplates to assess the number 
of clones. An aliquot from the same tube was then 
spread on AKGplates at a cell density allowing isolated 
colonies to be obtained after overnight growth at 37°G. 
Golonies were randomly picked up and individually 
grown at 37°G overnight under 700 rpm shaking in 500 
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[il of TB containing 100 l^g/ml ampicillin, 50 \ig/ml 
kanamycin and 34 |Jg/ml chloramphenicol (TBAKC) in 
a 96-welI deep-well plate. The next day, another 96- 
well deep-well plate containing the same volume of the 
same culture medium was seeded with 50 |il of the 
overnight culture, and the remaining pre-culture was 
used to individually seed 96-well ampicillin agar plates 
provided by GATC Biotech and sent to the same com- 
pany for plasmid purification and sequencing. The 
freshly seeded deep-well plate was grown for 1 h at 37° 
C under 700 rpm shaking. IPTG and arabinose were 
then added at final concentrations of 0.5 mM and 2%, 
respectively, and protein expression was allowed to 
proceed at 17°C for at least 20 h. Culture medium, cul- 
ture temperature during protein expression, and IPTG 
and arabinose concentrations leading to the best sig- 
nal-to-noise ratio were determined by using a frac- 
tional factorial approach modified from [28], the detail 
of which are provided in Additional file 5: Figure S3. 
At the end of the culture, the deep-well plate was spun 
for 3 min at 1,500 g and the culture medium discarded. 
The cell pellets were re-suspended in 500 [il of PBS by 
shaking at 700 rpm for 20 min at 17°C. The fluores- 
cence of 100 1^1 of re-suspended cells was measured 
using a TECAN GENios Plus spectrofluorimeter. The 
cell density was determined by measuring the optical 
density at 600 nm (ODgoo) of 100 |il of re-suspended 
cells diluted 10 times with PBS, using the same spec- 
trofluorimeter. The results were expressed as the fluor- 
escence to ODgoo ratio. 

Protein expression and purification 

T7pRos cells were extemporaneously co-transformed by 
heat-shock with the following pairs of constructs: 
pMRBAD-XD-CGFP and either pNGG-Ntail (positive 
control N) or pNGG-Stop-NxAiL (negative control S) or 
pNGG-NxAiL constructs bearing the mutated Nxail 
sequences of interest (clones 1-4); pMRBAD-Z-CGFP 
and pETlla-Z-NGFP (positive fluorescence control Z). 
In all cases, transformants were selected on AKCplates. 
TBAKC (4 ml per well in a 24-well deep-well plate) was 
seeded with a single colony from AKCplates, and then 
incubated overnight at 37°C under shaking. The next 
day, 4 ml of TBAKC in 24-well deep-well plate were 
seeded with 200 \il of the overnight culture and shaken 
at 37°C at 200 rpm until the medium became turbid 
(from -0.5 to ~1 ODgoo)- IPTG and arabinose were 
added at final concentrations of 0.5 mM and 2%, respec- 
tively, and the cultures were incubated overnight at 17°C 
under 200 rpm agitation. Cells were then recovered by 
spinning the deep-well plate for 5 min at 3000 g. Each 
cell pellet was re-suspended in 1 ml of 50 mM Tris/HCl 
pH 8, 0.3 M NaCl, 0.1% Triton XlOO, 1 mM EDTA, 10 
mM imidazole, 1 mM PMSF and 0.25 mg/ml lysozyme 



and frozen. After thawing, DNAsel and MgS04 were 
added at final concentrations of 10 [ig/ml and 20 mM, 
respectively and incubated at 37°C in a shaking incuba- 
tor for 30 min. The deep-well plate was spun for 10 
min at 3,000 g at 4°C. The supernatant (soluble fraction) 
was transferred to 1.5 ml microtubes and supplemented 
with 50 1^1 of a 50% (volume/volume) suspension of 
IMAC sepharose high performance (GE healthcare). The 
mixture was rotated for one hour on a wheel at 4°C. 
Sepharose beads were then washed three times with 1 
ml of 50 mM Tris/HCl pH 8, 0.3 M NaCl, 50 mM imi- 
dazole, and bound His-tagged proteins were eluted with 
100 \il of 50 mM Tris/HCl pH 8, 0.3 M NaCl, 500 mM 
imidazole. When purifying His-tagged proteins under 
denaturing conditions, the soluble and insoluble frac- 
tions were not separated by centrifugation after DNAsel 
treatment but the total lysate was directly supplemented 
with two volumes of 50 mM Tris/HCl pH 8, 8 M guani- 
dinium chloride, 0.3 M NaCl, 10 mM imidazole, and 
with 100 1^1 of a 50% (volume/volume) suspension of 
IMAC sepharose beads. Subsequent steps were carried 
out as described for the non denaturing conditions, 
except that the experiment was performed at room tem- 
perature, and that washing and elution buffers were sup- 
plemented with 8 M urea. In all cases, eluted proteins 
were analyzed by SDS-PAGE. 

Additional material 



Abbreviations 

Aa: Amino-acid; ACplates: LB agar 100 mm plates containing 100 |ig/ml 
ampicillin and 34 (jg/ml chloramphenicol; AKCplates: LB agar 100 mm plates 
containing 100 ng/ml ampicillin 34 |jg/ml chloramphenicol and 50 (jg/ml 
kanamycin; IPTG: Isopropyl P-D-l-thiogalactopyranoside; SDS-PAGE: Sodium 
dodecyl sulfate-polyacrylamide gel electrophoresis; TB: Turbo broth. 
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